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Abstract. This note concerns a one-line diagrammatic proof of the Cayley- 
Hamilton Theorem. We discuss the proof's implications regarding the "core 
truth" of the theorem, and provide a generalization. We review the notation 
of trace diagrams and exhibit explicit diagrammatic descriptions of the coef- 
ficients of the characteristic polynomial, which occur as the n + 1 "simplest" 
trace diagrams. We close with a discussion of diagrammatic polarization re- 
lated to the theorem. 
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1. Introduction 

Given annxn matrix A, p(X) = det(A — XI) is a degree n polynomial in A called 
the characteristic polynomial of A, whose roots are the eigenvalues of the matrix. 
The Cayley-Hamilton theorem says that a matrix satisfies its own characteristic 
equation p{A) = 0. Several proofs of this result have been given, including recent 
approaches via power series pQ and adjugates [5]. 

The purpose of this paper is to explain and generalize the following one-line 
proof of the Cayley-Hamilton Theorem, which appears in Section 6.5 of 0]: 

Theorem 1 (Diagrammatic Cayley-Hamilton theorem). Let (pj3^i represent the 



antisymmetrizer on n + 1 vectors in C™ . If A is an n x n matrix, then 

(i) Uu-iJ =n. 



Proof. Since n+1 vectors in C" are linearly dependent, it follows that E S = 0. □ 

Later sections will explain the notation used in the above theorem, and par- 
ticularly why |T]) is the matrix polynomial equation given in the Cayley-Hamilton 
Theorem (Proposition [T8| . The proof makes it clear that, from this point-of-view, 
the Cayley-Hamilton Theorem is a direct consequence of the dimension of the vec- 
tor space, which leads to linear dependence. Moreover, one is immediately able to 
generalize the diagrammatic equation to the following, whose proof is just as quick: 

l 
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Theorem 2. Let Ai, A%, . . . , A n be n x n matrices. Then 




The diagrammatic exposition here differs substantially from j4] , where TheoremQ] 
is also proven. The additional contribution here is Theorem [2] and the procedure for 
generalizing such identities. In addition, trace diagrams provide a more accessible 
language for describing diagrammatic formulas. 

This paper is organized as follows. In Section [2l we review some background in 
multilinear algebra and signed graph colorings. In Section [3l we define trace dia- 
grams and give some of their basic properties. The explanation of the diagrammatic 
Cay ley-Hamilton Theorem is given in Section [4j and we conclude with a discussion 
of identity polarization in Section [5] 



2. Preliminaries 

2.1. Multilinear Algebra. This section introduces our notation for multilinear 
algebra and tensors. A nice introductory treatment of tensors is given in Appendix 
B of [8]. 

Let V be a finite-dimensional vector space over a field F. In what follows, we 
assume that V has basis {ei, e 2 , . . . , e„}. The space of /c-tensors V® k = V® ■ ■ ■ ®V 
is itself a vector space with n k basis elements of the form 

&a = £*) e a2 • * • ^ctfc ; 

one for each a = (ai, cx%, . . . , aifc) e A^. By convention V®° — F. Denote by 
Fun(y lg,J , V® k ) the space of multilinear functions from V®- 7 to V® k . The tensor 
operation (/i, f2) <— > /i <8> fi on functions makes the space a monoidal category. 

2.2. Signed Graph Coloring. This section introduces graph theoretic principles 
that will be used in defining trace diagrammatic functions. The basic ideas con- 
tained here were first given in [12j . and further developed in [10] . 

Given a vertex v £ V of a graph G = (V,E), denote by E(v) the set of edges 
adjacent to v. We say that two edges in the same E(v) for some v are adjacent. 

Definition 3. A ciliated graph G — (V,E,a*) is a graph (V, E) together with an 
ordering a v : {1,2,..., deg(u)} — > E(v) of edges at each vertex v e V. 

By convention, when such graphs are drawn in the plane, the ordering is specified 
by enumerating edges in a counter-clockwise fashion from a ciliation, as shown in 
Figure [T] Ciliated graphs are sometimes also called fat graphs. 

Definition 4. Given the set N — {1,2, ... ,n}, an n-edge coloring of a graph 
G = (V, E) is a map k : E — > N such that no two adjacent edges have the same 
label. We denote the set of all n-edge colorings of a graph G by col(G) when n is 
understood. 

In graph theory, edge colorings are sometimes called Tait colorings. 



ON A DIAGRAMMATIC PROOF OF THE CAYLEY-HAMILTON THEOREM 



3 



Edge colorings induce permutations at the vertices of ciliated graphs. Given 
an edge coloring k and a degree- n vertex v, there is a well-defined permutation 
7r K (w) G S n at each interior vertex v G V n defined by 

n K (v) : i i ^ n(a v (i)). 

In other words, 1 is taken to the label on the first edge adjacent to the vertex, 2 is 
taken to the label on the second edge, and so on, as in Figure Q] 




Figure 1. By convention, a ciliation on a vertex induces a 
counter-clockwise ordering, shown as (ei, e%, e^, e^) at left. The 
coloring shown at right induces the permutation (J 2 3 4) a ^ the 
vertex. 



Definition 5. Given an admissible coloring k of a ciliated graph G = (V,E,a*), 
the signature sgn K (G) is the product of permutation signatures on the degree- n 
vertices: 

sgn K (G) = n Sgn(7r K (»), 
where sgn(7r K (i;)) is the signature of the permutation ir K (y). 

Definition 6. A pre-coloring of a graph G = (E, V) is a coloring k : E — > A of 
a subset E C E of the edges of G. A leaf coloring is a pre-coloring of the edges 
adjacent to the degree-1 vertices. 

Given edge sets E\ C E2, if the colorings k\ : E% — > N and k^ : E2 — > A" agree 
on we say that ki extends k\ and write ki >- Ki- 

Example. For n = 3, the pre-coloring Vr^ has two colorings 

1^2 

A iA 2 

In the first case, the permutations are ( \ \ 3) and ( \ \ \ ), so that the signature is 
s S n (123)' s S n ( 1 2 3 ) = — 1- I n tne second case, the permutations are (III) and 
(312)' indicating a positive signature. 

3. Trace Diagrams 

Penrose was probably the first to describe how tensor algebra may be performed 
diagrammatically [11) . In his framework, lines in a graph represent elements of 
the vector field F, and nodes represent multilinear functions. Trace diagrams are 
a generalization of Penrose's tensor diagrams, in which the edges are directed and 
may be labeled by matrices and the nodes represent the determinant form. The 
closest concept in graph theory is the gain graph, in which edges of a graph are 
marked by group elements. 
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3.1. Definitions. 

Definition 7. A trace diagram is a directed ciliated graph T> = (V\ U V n ,E, cr*) 
together with a marking of edges by linear transformations in Fun(V, V). Vertices 
have cither degree 1 (in Vi) or degree n (in V n ). The diagram is closed if V\ is empty. 
A framed trace diagram is a diagram together with a partition E\ = £7 U of 
the leaf edges adjacent to V\ into ordered inputs Ej and outputs Eo- 

By convention, framed trace diagrams are drawn with inputs at the bottom of 
the diagram and outputs at the top. Both are assumed to be ordered left to right. 

We also permit multiple markings on the same edge, with the understanding 
that 




If there are no markings, then I? is a determinant diagram, and the orientation of 
edges may be omitted. 

Trace diagrams require an expanded definition of coloring: 

Definition 8. A coloring of a trace diagram I? is a map k : E — » N x N labeling 
the head and tail of each edge by n{e)\ and «(e)2, respectively, in such a way that 
all labels near an n-vertex are different, and unmarked edges e have K(e)i = /c(e)2- 




Suppose further that A e represents the matrix marking on the edge e 6 £. Then 
the coefficient ip^^D) of the coloring is 

V>k(£>) = n( Ae ) K ( e )2«(e)i> 

where (A)ij represents the i, j-matrix entry. 

Thus, the coloring "picks out" the entry in the column corresponding to the 
incoming edge and the row corresponding to the outgoing edge: 

(4) S <- (A) ij = {e i ,Ae j ). 

'j 

Pre-colorings ks : (Ej)i 1— > on the input edges are in a one-to-one corre- 
spondence with basis elements eg € V®\ E '\; we will use a as shorthand for the 
pre-coloring. Together with an output edge coloring (3 S y®\ E o\^ the combination 
a U (3 is a leaf coloring. 

Definition 9. Given a framed trace diagram 2?, define the weight Xi(^) °f a l ea f 
coloring 7 by 

(5) X 7(^) = E s S n ^ P )^(^)- 



ON A DIAGRAMMATIC PROOF OF THE CAYLEY-HAMILTON THEOREM 



5 



Define the trace diagram function of T> by linear extension of 



(6) fv ■ e s >-> ^5u^( 2? ) e /3- 



(3eN\ E o\ 

If the diagram is closed, we define its value to be 



X(V)= J2 sgn K (P)^ K (P). 



KGcol(U) 



We will sometimes abuse notation by using the diagram T> interchangeably with 
/d G Fuxl(V®' Ei ' , V®\ E °'). As discussed in the next section, this is permissible 
because the operation preserves the monoidal structure. We also write formal 
linear sums of diagrams to indicate the corresponding sums of functions. 



Example. The diagram j has no vertices, so the signature is trivially +1, and the 
input and output must have the same label. This implies that j : t— ► e,;, verifying 
that the diagram is the identity. 

The two colorings of §5§ describe the action of the underlying diagram on the 
input ei ® e 2 : 



The input ei ® 62 corresponds to the pre-coloring at the bottom of the diagram, 
while each term in the output corresponds to a coloring extension whose coefficient 
is coloring's signature. 

Example. The simplest closed trace diagram with a matrix has edge colorings 



3.2. Trace Diagram Relations and Monoidal Structure. Denote by D(I, O) 
the free -F-module over framed trace diagrams with / inputs and O outputs. One 
may compose elements of 0\) with those of £>(Oi, O2) by gluing outputs to 

inputs. Since inputs are drawn at the bottom of a diagram and outputs at the top, 
composition involves drawing one diagram above another. 

One may also define T>\ ® T>2 as the diagram placing D2 to the right of £>i, 
making the space of framed trace diagrams a monoidal category. 

Theorem 10 (Theorem 5.1 in [10 ). The mapping T> ^ fx> of Definition^ is a 
functor of monoidal categories. 

Intuitively this result means that a trace diagram's function may be understood 
by breaking the diagram up into little pieces and gluing them back together. 

Definition 11. A trace diagram relation is a summation cjfD 6 D(I,0) of 
framed trace diagrams for which ^2 v cx>fv = 0. 

Given the monoidal structure, one can apply trace diagram relations on small 
pieces of larger diagrams (called local relations). Sometimes diagrammatic struc- 
tures are defined as free summations over diagrams modulo one or more local rela- 
tions [3]. 





for i = 1, 2, . . . , n. Thus, the diagram's value is 



(7) 
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Trace diagram relations can be made more general than multilinear relations by 
relaxation of the framing. Denote by D(m) the free F- module over tensor diagrams 
with m ordered leaves. A leaf partition gives a mapping 55 (m) — > D(I,0), where 
I + O = m. A (general) trace diagram relation is a summation ct>T> 6 S5(m) 
that restricts under some leaf partition to a framed trace diagram relation. 

Theorem 12 (Theorem 5.4 in |10j). Every leaf partition of a general trace diagram 
relation gives a framed trace diagram relation. 

Proof. By Definition the weights of a function depend only on the leaf labels, 
and not on the partition of framing of the diagram. Since the weights are the same, 
the relations do not depend on the framing. □ 

The fact that diagrammatic relations are independent of framing is very powerful. 
One may sometimes read off several identities of multilinear algebra from the same 
diagrammatic relation. 

Example. Let u,v,w G C 3 . One can show the cross product and inner product 
to be 



u x v = f \ and u • v = 

ml m [u] [v] 

One can also show that 

x= 

Consequently, 

A~X = n \ \ - i r , . 

[u][v][w][x] [u][v][w][x] ful [yHwllxl 
which is the vector identity 

(u x v) • (w x x) = (u • w)(v • x) — (u • x)(v • w). 

3.3. Symmetrization and Anti-symmetrization. 

Definition 13. Define the anti-symmetrization diagram by 

: e s i — > ^2 s 8' n ( cr )e ( T(5)- 

Note that if k > n, symmetrization maps any element to 0, since there cannot 
be more than n linearly independent elements of M . This is the observation that 
proves Theorems [1] and [2j 

Redundancies in strands adjacent to the same two nodes are captured by the 
following result: 

Proposition 14 (Lemma 6.6 in |10|). Let vx,t>2 G V n be adjacent vertices in a 
trace diagram T> with E = E(v\) C\E(v2), representing their shared edges, such that 
all edges e £ E are marked by the same matrix. Choose a coloring n of D that 
restricts to pre-colorings k on E and k c = k\^ c on E c . Then 

Xk .{V) = \E\\ Xk (V). 

The factorial in this result shows up in the next two propositions. 
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Proposition 15 (Proposition 6.4 in [10]). 



(9) 



(~l) LfJ 
TTT^T (n-k)\ 

Note in particular the case with k = n: 

= (-l)LtJ 




(10) 



Proposition 16 (Determinant Diagram, Corollary 6.5 in [TO]). 



(11) 




c*9 cb =(-^ i n\det(A). 



4. The Not-So-Characteristic Equation 

We now return to Theorem [T] The fundamental question is how this relates to 
the formula det(A — XI) = 0. The following example demonstrates the relationship 
in one case: 

Example 17. For 2x2 matrices, Theorem Q] implies 



6 C 





tr(A) 2 I + A 2 +A 2 - tr(A)A - tr(A)A - tr(A 2 )/ 
2 (A 2 - tr(A)A + det(A)l) = 0, 



where the last step uses the fact that det(A) = 1 (tr(A) 2 — tr(A 2 )). 

As is usually the case in diagrammatic statements, the "hard part" is demon- 
strating the equivalence to the traditional construction. The remainder of this 
section proves the following: 

Proposition 18. When the antisymmetrizer B5pjl in |T]) is expanded, the coeffi- 
cients of A 1 are equal to n\ times the coefficients of X 1 in the characteristic polyno- 
mial det(A - XI) = 0. 

First, how can the coefficients of the characteristic polynomial be described di- 
agrammatically? 

Lemma 19. Given n x n matrices A, B, the determinant sum det(A + B) is ex- 
pressed diagrammatically as 



(12) 



n i 

det(A + B) = (-l)LiJ]r 



i=0 



i\(n — i)\ H 
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Proof. Proposition [16] states that 



(13) 




= (-l)LtVdct(A). 



Replacing A with A + B in this diagram, one obtains 2 n diagrams, each of which has 

the form Ln\'"A. where ? is either A or B. By Proposition 1141 one may rearrange 

the strands of the diagrams so that the ^4's and the -B's are grouped separately, 
without changing the value of the diagram. Therefore, all (?) = dia grams 

that have i strands labeled by B have the same value, and the result follows. □ 

Corollary 20. In terms of diagrams, the characteristic polynomial is 



(14) 



i=0 



(-i)*+m 

i\(n — i)\ 




i=0 



where 



(15) 



(_1)*+LtJ / 
i\(n — i)\ < 4 




This means that the coefficients of the characteristic polynomial are, up to a 
constant factor, the n + 1 "simplest" diagrams with two nodes. Are these also the 
coefficients of A 1 in (JTJ) ? The next lemma provides the combinatorial decomposition 
of the antisymmetrizer that is required to demonstrate this fact. 

Lemma 21. For any k with < k < n, 



(16) 



^ (k-i)\ 

i=0 y ' 




Proof. Choose a summand corresponding to a permutation a £ Sfc+i- Write the 
permutation as a — rv where r is the cycle containing the first element and v 
contains the remaining cycles. Then |t| = i + 1 in the summand, since the left 
strand passes through i strands, and the summand contributes to the A % term. 
There are ppni ways to select r such that |r| = In each case, the remainder 

of the diagram is closed and all choices of v can be consolidated into a single 
term. The sign of a summand is given by sgn(c) = sgn(r)sgn(^) = (— l) l sgn(z^), 
and sgn(^) is also incorporated within the ■ Hence, the coefficient of A 1 is 

(— 1)' ( fc ^ ! times the closed diagram shown. □ 
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Proof of Proposition \7R Letting k — n in Lemma [21] and applying Proposition [15] 
shows that 




^ (-l)'(-l)LfJn! Ct 
77. TTTTi a 



i=0 



(n — i)\i\ 




i=Q 



□ 



5. Conclusion 

We have now explained the notation underlying Theorems [T] and and seen that 
in the first case the diagram translates into what is normally understood as the 
Cayley-Hamilton theorem. One of the beautiful consequences of the proof is that 
the coefficients of the characteristic polynomial, sometimes called the fundamental 
matrix invariants, match up precisely with the n+ 1 "simplest" diagrams involving 
a single n x n matrix: 



tr(A) = .Q 





The alert reader may notice that if n is even, an additional nontrivial diagram may 
be constructed from a single node. 

Conjecture. For a 2n x 2n skew- symmetric matrix A, the Pfaffian Pi(A) corre- 
sponds to 



(17) 



Pf(A) oc 




Theorem[5]relates to the technique of polarization, which transforms a trace iden- 
tity with a single matrix A of degree k into an identity with k matrices Ai , . . . , A^ . 
Formally, the polar version of an identity t{A) = is 



(18) 



1 9 9 ,A A 



+ \ k A k ) = 0. 



Theorem [5] gives the same result as polarization, as can be inferred by the sum 
formulae of Section^ For example, the simple formula A 2 — tr(A)A + det(A)I = 
corresponds to 



0. 



which reads as 





tT(A 1 )tr(A 2 )I + A 2 A X + A X A 2 - tr(A 2 )Ai - tr(A 1 )A 2 - tx{A x A 2 )I = 0. 



10 



ELISHA PETERSON 



It is not hard to verify that this is the polarized version of the original identity 
given by (fl8]) . 

The same process can be applied in other scenarios. As a simple example, it is 
just as easy to prove that 




Upon taking the trace, one obtains 

tr(ABC) + tx(ACB) 

= tr(AB)tr(C) + tr(A)tr(BC) + tr(B)tv(CA) - tr(A)tr(B)tr(C). 

This is sometimes called the Fricke sum relation and features prominently in the 
invariant theory of 2 x 2 matrices [5J |H1 [7J IS] • 
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